Text Classification by PNNbased Term Reweighting
نویسندگان
چکیده
منابع مشابه
Text Classification by PNN-based Term Re-weighting
Current approaches to feature selection for text classification aim to reduce the number of terms that are used to describe documents. Thus, documents can be classified and found with greater ease and precision. A key shortcoming of these approaches is that they select the topmost terms to describe documents after ranking all terms using a feature selection measure (scoring function). Lesser hi...
متن کاملImbalanced text classification: A term weighting approach
The natural distribution of textual data used in text classification is often imbalanced. Categories with fewer examples are under-represented and their classifiers often perform far below satisfactory. We tackle this problem using a simple probability based term weighting scheme to better distinguish documents in minor categories. This new scheme directly utilizes two critical information rati...
متن کاملEfficient Text Classification Using Term Projection
In this paper, we propose an efficient text classification method using term projection. Firstly, we use a modified χ statistic to project terms into predefined categories, which is more efficient compared to other clustering methods. Afterwards, we utilize the generated clusters as features to represent the documents. The classification is then performed in a rule-based manner or via SVM. Expe...
متن کاملTerm Graph Model for Text Classification
Most existing text classification methods (and text mining methods at large) are based on representing the documents using the traditional vector space model. We argue that important information, such as the relationship among words, is lost. We propose a term graph model to represent not only the content of a document but also the relationship among the keywords. We demonstrate that the new mo...
متن کاملInformation extraction by text classification
Information extraction and text classification are usually seen as complementary forms of shallow text processing, in that they are aimed at very different tasks. In this paper, we describe two simple but real-world domains in which text classification techniques can be used directly for information extraction. Specifically, we describe systems for extracting information from business cards, an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2011
ISSN: 0975-8887
DOI: 10.5120/3701-5188